Rank | Count | Beginning |
---|---|---|
6 | 13463 | وقال |
60 | 6187 | وفي |
7 | 5850 | وكان |
1 | 5741 | كما |
8 | 5662 | وقد |
17 | 4285 | من |
56 | 3835 | ومن |
203 | 3481 | وكانت |
99 | 3187 | في |
43 | 3181 | واضاف |
111 | 3109 | وقالت |
101 | 2863 | لكن |
33 | 2755 | وأضاف |
184 | 2545 | أما |
537 | 2484 | - |
59 | 1828 | ولم |
764 | 1673 | وأكد |
153 | 1661 | هذا |
270 | 1543 | ولكن |
318 | 1526 | لا |
147 | 1513 | إن |
116 | 1433 | يذكر |
5 | 1421 | وأشار |
626 | 1404 | ولا |
291 | 1376 | ويقول |
720 | 1351 | لقد |
154 | 1311 | فقد |
467 | 1305 | وذكر |
863 | 1283 | هل |
119 | 1211 | أعلن |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV